Induction of Rules for Biological Macromolecule Crystallization
نویسندگان
چکیده
X-ray crystallography is the method of choice for determlning the 3-D structure of large macromolecules at a high enough resolution. The rate limiting step in structure determination is the crystallization itself. It takes anywhere between a few weeks to several years to obtain macromolecuiar crystals that yield good diffraction patterns. The theory of forces that promote and maintain crystal growth is pre|im;nary, and crystallog~aphers systematically search a large parameter space of experimental settings to grow good crystals. There is a wealth of experimental data on crystal growth most of which is in paper laboratory notebooks. Some of the data ha~ been gathered in electronic form, e.g., the Biological Macromoleculur Crysta]l;~ation Database (BMCD) which is a repository of successful experimental conditions for grow° Lug over 800 different macromoiecules (GUl~and 1987). Crystallographers are in need of computational tools to gather and analyze past data to des~n new crystal growth trails. We are building the Crystallographer’s Assistant (CA) to help crystallographers record and maintain experimental context in electronic form, offer suggestions on experimental conditions that are likely to be successful, and provide explanations for failed experiments. As an initial step in this project, we have applied B_L, an inductive learning program, to the BMCD. In this paper we report initial experiments and findings in applying RL to the BMCD. From the point of view of crystallography, we have discovered possibly significant new empirical relationships in crystal growth. From the point of view of machine learning, our work suggests refinements of existing methods for incorporating detailed domain knowledge into inductive analysis techniques.
منابع مشابه
Development of Crystallization Strategies Using the Biological Macromolecule Crystallization Database
The NIST/NASA/CARB Biological Macromolecule studies requires not only finding suitable chemical agents Crystallization Database (BMCD) contains crystal data that induce and sustain crystal growth but also that those and crystallization conditions for biological parameters such as protein concentration, ionic strength, macromolecules abstracted from the literature. Each temperature, pH, etc., be...
متن کاملThe Biological Macromolecule Crystallization Database and NASA Protein Crystal Growth Archive
The NIST/NASA/CARB Biological Macromolecule Crystallization Database (BMCD), NIST Standard Reference Database 21, contains crystal data and crystallization conditions for biological macromolecules. The database entries include data abstracted from published crystallographic reports. Each entry consists of information describing the biological macromolecule crystallized and crystal data and the ...
متن کاملStatistical methods for the objective design of screening procedures for macromolecular crystallization.
The crystallization of a new macromolecule is still very much a trial-and-error process. As is well known, it requires the search of a large parameter space of experimental settings to find the relatively few idiosyncratic conditions that lead to diffraction-quality crystals. Crystallographers have developed a variety of screens to help identify initial crystallization conditions, including tho...
متن کاملTapping the Protein Data Bank for crystallization information.
A database application has been developed for the collection of crystallographic information. This database (the BDP) has been populated with the information found in the Protein Data Bank (PDB). The tool has been used to store crystallization data parsed out of the PDB and these data may be used to extend the crystallization information found in the Biological Macromolecule Crystallization Dat...
متن کاملCorrelation between Protein Sequence Similarity and Crystallization Reagents in the Biological Macromolecule Crystallization Database
The protein structural entries grew far slower than the sequence entries. This is partly due to the bottleneck in obtaining diffraction quality protein crystals for structural determination using X-ray crystallography. The first step to achieve protein crystallization is to find out suitable chemical reagents. However, it is not an easy task. Exhausting trial and error tests of numerous combina...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001